Properties of the random seed input to Bell tests 
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Device independent protocols rely on the violation of Bell inequalities to certify properties of 
the resources available. The violation of the inequalities are meaningless without a few well-known 
assumptions. One of these is measurement independence, the property that the source of the states 
measured in an inequality is uncorrelated from the measurements selected. Since this assumption 
cannot be confirmed, we consider the consequences of relaxing it and find that the definition chosen 
is critically important to the observed behavior. Considering a definition that is a bound on the 
min-entropy of the measurement settings, we find lower bounds on the min-entropy of the seed used 
to choose the inputs required to deduce any quantum or non-local behavior from a Bell inequality 
violation. These bounds are significantly more restrictive than the ones obtained by endowing 
the seed with the further structure of a Santha-Vazirani source. We also outline a procedure for 
finding tight bounds and study the set of probabilities that can result from relaxing measurement 
dependence. 

PACS numbers: 03.67.-a,03.65.Ta,03.65.Ud 



I. INTRODUCTION 

The violation of Bell inequalities can be used to certify 
important quantum information properties in a black- 
box scenario under minimal assumptions. This idea of 
"device-independent" certification started in the context 
of quantum key distribution, where the violation of Bell 
inequalities bounds the information leaked to the eaves- 
dropper [1-3 ; and it has been extended to various other 
tasks, notably state certification [UHllS], measurement 
certification [6 , and private randomness expansion [7-9 . 
Ultimately, this stems from the fact that the violation 
of Bell inequalities certifies the presence of a quantifiable 
amount of intrinsic randomness: indeed, a contrario^ if 
the outcomes were predictable, one could have predicted 
them in advance and the measurement could consist of 
reading from a pre-existing list. This is exactly what the 
violation of Bell inequality certifies as impossible. 

Two assumptions are left in device-independent certi- 
fication. The first is no- signaling: the choice of the mea- 
surement setting of one party should not be known to the 
measurement boxes of the other parties before they pro- 
duce their outcome. This can be guaranteed ultimately 
by ensuring space-like separation, although one may also 
trust a weaker demonstration of separation, as for in- 
stance in [7 . The second assumption is measurement 
independence: the information A contained in the boxes 
in each run should be uncorrelated from the choice of the 
settings in that run. So far, no way of checking measure- 
ment independence is known in a black-box scenario: the 
best one can do is to buy the source of A and the devices 
that choose the settings from different providers, who 
are believed not to be conspiring together. Alternatively, 
one can partly give up the black-box scenario, character- 
ize the devices and be confident that the relevant degrees 
of freedom are uncorrelated. 

It is clear that no-signaling and measurement inde- 
pendence cannot be arbitrarily relaxed: if any amount 



of signaling is allowed, or if arbitary correlation is ad- 
mitted between source and settings, the violation of a 
Bell inequality can be obtained with purely classical re- 
sources A, so there is no hope to conclude that A con- 
tains intrinsic randomness. However, with the aim of 
reducing the assumptions of device-independent certifica- 
tion to their bare minimum, one can partially relax no- 
signaling and measurement independence, and ask how 
much information must be signaled and how much mea- 
surement dependence must be allowed for a Bell test to 
become irrelevant [10 . In this paper, we focus on the 
latter question, the study of partial measurement depen- 
dence (sometimes called reduced measurement indepen- 
dence or reduced "free will"), which has been the object 
of a few recent studies [TTHI4] . 



II. MEASUREMENT DEPENDENCE AND ITS 
BASIC CONSEQUENCES 

A. Measurement independence 

For the sake of this introduction, we consider a bi- 
partite Bell scenario. Operationally, a Bell experiment 
consists of N apparently identical runs [32 , in each of 
which box A receives input x and outputs a value a, box 
B receives input y and outputs a value b. One can then 
estimate the statistics p{ab\xy). We denote by A the in- 
formation present in the boxes in a given run. 

Measurement independence, the assumption that we 
want to relax, is captured by the condition 



p{X\xy) =p{X) yx,y. 



(1) 



Under this assumption, the observed statistics are mod- 
eled by 



PMi{ab\xy) = p{ab\xyX)p{X)dX 



(2) 



The specific goal of a Bell test is to assess whether 
there is intrinsic randomness in the boxes, that is, in 
the usual terminology, to guarantee that A is not a local 
variable. Mathematically, local variables are defined by 
p{ab\xyX) = p{a\xX)p{b\yX). It is useful to stress that, as 
written, Q contains an additional assumption, namely 
that A itself is chosen independently in each run accord- 
ing to the distribution p{X). Under measurement inde- 
pendence, it can be proved that this is ultimately not a 
restriction for Bell tests, although one has to be careful 
in interpreting statistics from finite samples p!5HT7] . 

Measurement independence cannot be denied in a sys- 
tematic way without undermining the scientific method 
itself (if a clinical trial is to make sense, whether each 
patient receives the drug or the placebo cannot depend 
on the any details of the patients' conditions). However, 
it is certainly possible to question measurement indepen- 
dence in a given setup: the devices that determine the 
inputs X, y may be correlated to the process that deter- 
mines A. The origin of such correlation may be triv- 
ial, like the fiuctuations in power of the city network to 
which all the devices are connected; it may be due to 
lack of attention of the experimentalists, who introduced 
unwanted connections; or it may be strongly conspirato- 
rial, in an adversarial scenario in which the devices come 
from an untrusted provider. In all cases, M does not 
hold, nor does the proof that one can restrict the study 
to independently-chosen A. 
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FIG. 1: (color online) There are many different processes by 
which the information A in devices A and B might become 
correlated with the inputs to the devices x and y, as discussed 
in the text. In this illustration the processes are represented 
by some external pre-existing variable /i that serves to intro- 
duce the correlation. The blue boxes represent the physical 
random number generators used to pick the inputs to the Bell 
test. 

By relaxing condition (fTl), one allows correlations be- 
tween the source A and the choice of the settings x^y. 
Bayes theorem implies that 



causal relation: all is compatible with both A and x^y 
being influenced by a common cause (Fig. [l]). That be- 
ing clarified, our discourse will be mostly phrased in the 
second way (the first way will be used in Section VI). 
We shall then look at measurement dependence as re- 
ducing the probability of certain pairs of settings. In the 
case where the dependence is sufficient to exclude enough 
pairs of settings, unwanted features of local variable mod- 
els may be hidden. This is the same intuition behind 
the power of the detection loophole; in fact, measure- 
ment dependence is even stronger, because it may allow 
to exclude a single pair of settings, whereas the detec- 
tion loophole is local and excludes all pairs of settings 
such that one given setting of (say) Bob is associated to 
unwanted features. This opens a wealth of possibilities 
that we review rapidly next. 



B. Effects of measurement dependence 

The obvious effect of measurement independence is the 
possibility of faking a violation of Bell inequalities. A Bell 
inequality is built on a linear combination of p{ab\xy), 
whose maximal value (called algebraic limit) cannot be 
reached by local variables. If, in each run, one can ex- 
clude some suitable pairs of settings in correlation with 
the content of the boxes A, then it becomes possible to 
reach the algebraic limit while having only local variables 
in the boxes. 

Let us illustrate this point with the most famous Bell 
inequality, that of Clauser, Home, Shimony and Holt 
(CHSH). The inequality reads 



\{aobo) + {aobi) + {aibo) - {aibi)\ < 2 



(4) 



p{X\xy) ^ p{X) 4=^ p{xy\X) ^ p{xy) . 



(3) 



with ax^by G { — 1, +1}. In order to achieve the algebraic 
limit of 4, one should have ao = 6o, ai = 6o, ao = bi and 
ai = —bi. Local deterministic points exist that satisfy 
three out of these four conditions. If one wants to achieve 
the algebraic limit with local variable and measurement 
dependence, a sufficient strategy is the following: in each 
run, A is chosen among the aforementioned local deter- 
ministic points, and the pair of settings corresponding to 
the unwanted condition is never chosen JOl HI] • 

The fact that a sufficient amount of measurement de- 
pendence can lead to the algebraic limit has an intriguing 
consequence for some inequalities. Indeed, in generic in- 
equalities, the algebraic limit may lie even above what 
can be reached with no-signaling correlations. For in- 
stance, the tilted CHSH inequality 

\{aobo) + (aobi) + (aibo) - (aibi) + a{ao)\ < 2+a , (5) 



The first relation could be read as "the output of the 
source is restricted for a given choice of settings", the 
second as "the choice of settings is restricted for a given 
output of the source" . Neither needs to refer to a real 



has an algebraic limit of 4 + a, but no-signaling correla- 
tions can reach only up to 4 if a < 2 [18 . If measurement 
dependence is allowed, to the point that one pair of set- 
tings can be excluded, then one can achieve the algebraic 



limit with a convex mixture of 

A = (+1,-1,-1, +1) together with {x,y)y^OO 

A = (+1,+1,+1,-1) together with {x,y)^01 

A = (+1,-1,+1,+1) together with {x,y)^10 

A = (+1,+1,+1,+1) together with {x,y)^ll 



(6) 



where we denoted a local deterministic point as A = 
(ao,ai,6o,6i). If a Bell test is run with this underly- 
ing strategy, the observed correlations will lie outside 
the no-signalling polytope, i.e. are formally signalling. 
Obviously, this does not mean that measurement depen- 
dence makes it possible to use entanglement to actually 
send a message: in order for (say) Alice to send a mes- 
sage to Bob, she must be able to choose her setting at 
will, which is precisely what measurement dependence 
denies. At any rate, one must be careful when working 
with measurement dependence: the worst case are corre- 
lations that reach the algebraic limit, not the no-signaling 
one (to our knowledge, all the studies of measurement de- 
pendence so far dealt with inequalities for which the two 
limits happen to coincide p!QHl4] ). 

The take-away message of this paragraph is that one 
does not have to reach the extreme case of total mea- 
surement dependence (i.e. A determining x^y uniquely): 
already with some partial amount of measurement depen- 
dence, it becomes impossible to draw any conclusion from 
the violation of a Bell inequality. One of our main result 
will consist in deriving general bounds for this amount 
(Section IV). In order to do that, we need first to de- 
fine suitable figures of merit that quantify the amount 
of measurement dependence. This is the object of the 
following section. 



III. QUANTIFYING MEASUREMENT 
DEPENDENCE 

From now onwards, p{xy\X) will be called the seed of 
the Bell test. Partial measurement dependence means 
that there is some randomness in the seed, and we aim 
at quantifying this amount of randomness. The char- 
acterization of sources of randomness is a well studied 
subject in computer science, so we are going to adapt 
this known material to our problem [33|. Let us start by 
arguing that some attention is indeed required. 



A. Critical choice of a figure of merit 

We illustrate our point with an example. The chained 
inequality is a bipartite Bell inequality with m settings 
for each party and binary outcomes a, 6 for both mea- 
surements on A and 5, which reads 

Im = p{a = b\x = l,y = m)^ ^ p{a ^ b\x, y) 

x,y s.t. 

xe{y,y-\-l} 

< 2m-l. (7) 



It has been used to put stringent bounds on quantum 
theory thanks to the property that, in the limit m ^ oo, 
its algebraic limit /^ = 2m can be reached with mea- 
surements on quantum states [19||20j. 

Out of the m^ possible pairs of settings, 2m are ef- 
fectively used in the inequality. Furthermore, there exist 
local deterministic points that can satisfy 2rri — 1 of these 
conditions. Therefore, in order to verify any conclusion 
based on the chained inequality, it is enough to have an 
amount of measurement dependence that allows the ex- 
clusion of only one pair of settings out of m'^. In the limit 
of large tti, under whichever measure, such a seed is very 
close to a fully random source: for instance, its min- 
entropy per run (defined below) is log(m^ ~ 1)? which 
differs from the fully random value \ogm^ by 0{m~'^). 
This example shows that a seed, which would presum- 
ably be considered as good as it gets in an abstract as- 
sessment, is already catastrophic for the Bell inequality 
under study. Notice that this remark is not in contradic- 
tion with the results of [TT] , which can be seen as proving 
that the chained inequality is pretty robust to measure- 
ment dependence: indeed, in that work, the additional 
Santha-Vazirani assumption was made on the seed, which 
implies that all the pairs of settings are possible in each 
run. Our argument, based on excluding one setting in 
each run, does not apply. 

It is now time to present the definitions we have just 
sketched in their suitable formal setting. We shall con- 
sistently use the word seed to stress that the source of 
randomness we are interested in is the randomness of 
the inputs given the knowledge of the physical process 
A or vice versa, not the randomness possibly present in 
A (which would be the intrinsic randomness of quantum 
origin in the ideal case). 



B. Types of seed 

Here we review rapidly some well-known types of 
sources of randomness for the purpose of this paper, re- 
ferring to [21 J for a comprehensive study. 

Consider a random variable Z in an alphabet Z of size 
d; and let Z = Zi...Zn be an A/'-dit string. In our case, 
Z will represent the settings chosen for the Bell test, i.e. 
Z = (x^y) in a bipartite scenario. Randomness being 
synonymous with unpredictability, a source of random- 
ness (here, the seed) will be characterized by specify- 
ing what one wants to predict and how predictable it is, 
given some prior information A (supposed to be classical 
throughout this paper). One would then say that the 
seed contains randomness if 



(8) 



Pguess(Z|A) := J2 Pi^ = A)Pguess(Z|A = A) < 1 , 
A 

where Pguess(Z|A = A) := maxzp(Z = z|A = A). The 
amount of randomness is quantified by the min-entropy 



Hmin{Z\\) := -logPg„ess(Z|A). 



(9) 



Clearly, H^i^{Z\A) > implies the presence of some 
randomness. To someone who does not have access to A, 
the source will appear to have min-entropy H^i^{Z) = 
— log Pguess(Z) which can only be higher by the data pro- 
cessing inequality [34 . 

The loosest characterization of the seed, i.e. the one 
that requires fewer assumptions, simply puts a bound on 
the min-entropy: 

Definition 1. Min-entropy seed. A random variable Z 
is a k -min-entropy source of randomness with respect to 
another random variable A i/ilfmin(Z|A) > k. 

As soon as A: > 0, the knowledge of A does not deter- 
mine z uniquely. One can add some structure to a min- 
entropy seed. For instance, a /c-min-entropy seed is called 
uniform if i:fmin(Z|A = A) := - logPguess(Z|A = X) > k 
for all values of A. A block min-entropy seed is one for 
which not only the min-entropy of the whole string, but 
the min-entropy of blocks is also lower bounded. These 
notions will not be used in this paper. 

As soon as /c < \og{d^ — 1), the definition of k-min- 
entropy seed is compatible with PguessC^ = z|A = A) = 
for one string z. As hinted in paragraph III A[ the 
possibility that some settings are not chosen is critical for 
seeds of Bell tests. Because of this, one may want to add 
to the properties of the seed the assumption that all the 
d^ strings have non-zero probability. This is equivalent 
to the following type of source: 

Definition 2. Santha-Vazirani seeds. A random variable 
Z is a {pmin^Pmax) S antha- Vazirani source with respect 
to A (where < Pmin ^ ^/d and 1/d < Pmax ^ ^) ^/ 



Pn 



<p{Zi\X,Zi,...,Zi-i) <Pmax Vz 



(10) 



If Zi is a bit, Pmin = 1 — Pmax is usually written 6 [22] . 
Some of the most important results in measurement de- 
pendence in Bell tests have been obtained for Santha- 
Vazirani sources [HI [131 (HI- These results show that 
there is a real advantage in considering Bell-based ran- 
domness, because it overcomes no-go theorems for clas- 
sical information. 

Finally, let us focus on distributions that are indepen- 
dent and identically distributed (i.i.d.) such that 

N 

p(Z = z|A = A) = nP(^i = ^i|Ai). (11) 



This can also be viewed as a block min-entropy source 
where each block consists of only one symbol, Zi. In this 
case, the Santha-Vazirani definition implies: 



Vn 



<p{z\\) <Pr, 



(12) 



We will use a different notation such that Pmax = Pm 
and Pmin = Pm to make clear that we are in the i.i.d. 
scenario. Then the definition of uniform min-entropy 



sources is equivalent to the figure of merit of measure- 
ment dependence used in [12], namely 



Pm := maxp(z|A) , 



[i.i.d.] 



(13) 



since H^i^{Z\A = X) > k for all A is equivalent to Pm < 
2~^. In the following, we will use these two figure of 
merits interchangeably for i.i.d. models. 

Instead of bounding the largest probability, the small- 
est probability also gives information on measurement 
dependence, as first proposed in [23J: 



m.mp{z\X) . 

z,X 



[i.i.d.] 



(14) 



If only Pm is explicitly bounded, then a bound on P^ 
can be inferred, however, it might be trivial, since it can 
be negative: P^ > 1 — {d — 1)Pm- Bounding only the 
min-entropy of the input seed to the Bell test, or equiva- 
lently bounding only Pm, which is the guessing probabil- 
ity, allows much different worst-case behavior in Bell tests 
than when the Santha-Vazirani definition is adopted, as 
we shall now explore. 



IV. LOWER BOUND FOR MIN-ENTROPY 
SEEDS 



We will be dealing with a K-partite Bell scenario where 
the i^^ party has rrii > 1 measurement settings {mA^TriB 
for bipartite) and each setting has an arbitrary number of 
outcomes. The joint configuration of settings z = zi...zk 
with Zi G {1, ..rrii} (z = xy for bipartite) is a i^-tuple in 
the set of all settings S of size Yli^i rrii. In this Section, 
moreover, we consider a Bell test in which the observed 
statistics of the settings follow a uniform distribution, 
that is 



ff„,in(Zi,...,ZK) = iVlog|5|, 



or equivalent ly 



K 



Pohs{zi...ZK) := ^p{zi...zk\X)p{X) = {Y[ 



rrii 



(15) 



.(16) 



i=l 



This is not an assumption like those on the nature of the 
seed: Pobs is observed in a realization; but it is a fre- 
quent working assumption for theoretical works, which 
was made in all previous works on measurement depen- 
dence. In Section [V| we shall see that a non- uniform p^bs 
has interesting consequences in studies of measurement 
dependence. 

We are presently able to discuss our main result: a 
lower bound on the min-entropy of the seed, below which 
no conclusion can be drawn from any Bell test, unless 
further structure is assumed. 



A. Reaching the no-signaling limit 

The main insight is provided by the fohowing Lemma, 
which we present in the bipartite scenario (the gener- 
ahzation to multipartite scenarios holds with identical 
proofs and more cumbersome notation, so we give it in 
Appendix [A| : 

Lemma 1. Let P{ab\xy) be an arbitrary no-signaling 
distribution with x G {1,...,7ti^} and y G {1, ..., ttib}- 
For any pair of settings {x^y), there exists a local distri- 
bution PL{ab\xy) such that 

PL{ab\xy) = P{ab\xy) (17) 

for{x,y) e S^,y = {{x,y),{x\y):xe{l,...,mA}, 

y' e {l,...,mB}}. 

Moreover, this result is tight: if another pair of settings 
is added to the subset of pairs, there exists a no-signaling 
point for which those probabilities are nonlocal. 

Proof The proof can be done by constructing explicitly 
one such local distribution. Let us fix (x, ^) = (1, 1) with- 
out loss of generality. From the no-signaling distribution 
P, we construct 

P(ai,a2, ...,a^^; 61,62, ••-,6^^) 

rUA TUB 

= P{a^)P{br\ar) l[P{aj\bi) H^l^^l^i) (18) 

j=2 k=2 

with obvious notations. This is a valid joint proba- 
bility distribution over the outcomes of all the mea- 
surements. Now, on the one hand, the marginals 
P{aj]bk) = PL{ci^b\j^k) define a local distribution, as 
first proved by Fine [24 . On the other hand, it is 
easy to show that P(ai;6/e) = P(a, 6|1,A:): one should 
sum first over all possible values of a2,...,a^^ to find 

P(ai;6i,62,...,6mB) = P{^i) nr=i ^(^^l^i)' after which 
the sum over the 6's is obvious. Similarly one proves that 
P(aj; bi) = P{a, 6|j, 1). So indeed we have a local distri- 
bution that mimicks the initial no-signaling one on the 
desired subset of pairs of settings. 

As for the tightness, suppose that we add a single pair 
of settings, say (2,2), to 5i,i: there exist no-signaling 
points for which CHSH is violated by the settings (1, 1), 
(1,2), (2,1) and (2,2); so those statistics can't be mim- 
icked by a local distribution. D 

Now we can state the main theorem: 

Theorem 1. Consider a min-entropy seed with an ob- 
served min-entropy i^min(XY) = N\og{mATnB) for an 
N-run bipartite Bell test with tua inputs on Alice, ttib 
inputs on Bob and arbitrary alphabets for the outcomes. 
If 

if^in(XY|A) < iVlog(m^+mB-l) (19) 

no conclusion can be drawn from the Bell test, since the 
no-signaling limit of the inequality can be reached with 



local distributions. The generalization of this result to 
K-partite Bell tests reads 



K 



i7^in(Zi...ZK|A) < Anog(^mfc-i^ + l).(20) 



/c=l 



Notice in particular that, without further assumptions, 
any source of randomness with H^i^(KY\K) < N log 3 
is useless as a seed for any Bell tests. 

Proof. We will construct an explicit i.i.d. seed which al- 
lows the faking of a Bell violation up to the no-signalling 
bound with appropriate local resources. From Lemma [l] 
we know that there exist subsets Sx^y of niA + tub — 1 
pairs of settings, for which no difference can be seen if 
a local distribution is substituted for a possibly nonlo- 
cal no-signaling point: in particular, this could be the 
no-signaling point that reaches the no-signaling limit for 
the inequality under study. If iJmin(XY|A) is sufficiently 
low, the seed will allow only the pairs of settings that be- 
long to one of the Sx^y and distribute the corresponding 
local strategy Xx.y- The seed 



p{xy\\x^y) 



TnA+niB — '^ ' 



0, 



\ix,y e Sx,y 
otherwise 



(21) 



has Pm = 7^ T in each run, whence we have proved 

the bound ([T9| as long as we can find p{Xx,y) such that 
T^x,yPi^y\^x,y)p{><x,y) = Pobs{xy) foTsllx.y. In the casc 
where Pobs is uniform, this can always be found by simply 
choosing uniformly the pair (x,^), i.e. p{Xx^y) = — - — • 
This concludes the proof for the bipartite case. The proof 
of the multipartite case is identical using the material of 
Appendix |AJ The final remark of Theorem [T] stems from 
the fact that each Bell test much involve at least two 
parties and each must have at least two settings. 

D 



Because of the tightness of Lemma IT] the bounds (19) 
and ( 20 ) are the best inequality-independent bounds that 
one can obtain with i.i.d. seeds. Moreover, since there 
exist inequalities for which the quantum and the no- 
signaling limits coincide, the bound to reach the quantum 
limit cannot be better. If the inequality is given, however, 
much less measurement dependence may be sufficient to 
reach the no-signaling limit, and even less to reach the 
quantum limit if it is lower. We elaborate further on this 
point in the following paragraph. 



B. Inequality-dependent bounds 

Let B define a Bell inequality, whose local, quantum 
and no-signaling limits are given by Pl < ^q < ^ns, 
and S^ be the set of settings that are used by the 
Bell inequality [35^. Again, for each A there is a lo- 
cal strategy for assigning outputs such that in order to 
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FIG. 2: (color online) Seeds t hat reach the critical min-entropy bound for uniform observed distribution of settings. For the 
inequality independent bound (20), the seed is uniform on ttia -\-mB — 1 settings and is zero elsewhere. For a given inequality, 
the no-signaling limit may be reached with a seed that is uniform on a larger number of settings |<S^|, and still zero on the 
others; in order to reach only the quantum limit, one can allow the settings in S^ to be used sometimes. Of course, for each 
A, the settings that are chosen may vary. 



achieve the no-signaling limit, some settings will be in- 
compatible with this strategy and must be hidden by 
measurement dependence. Let this set of inputs be 
S^{X). Then, an arbitrary no-signalling point is required 
to be compatible with a local point only on the subset 
S^{X) := S^ \ S^{X). Suppose an inspection of the in- 
equality B shows that at most \S^\ of these \S^\ settings 
must be hidden for any choice of A. Once the probabilities 
of the settings S^{X) are set to zero, the min-entropy is 
maximized by the uniform distribution over the remain- 
ing \S^\ settings (FIG. p|. However, one must be very 
careful to show the existence of p(A) which satisfies (16). 
Whenever such a distribution exists, if Pm ^ l/l*^^!? the 
non-local game can be won with probability one with lo- 
cal strategies. As implied by the results of the previous 
section, if the observed input distribution Po55(z) is uni- 
form then a strategy in the form of equation (21) or its 
generalization in Appendi x [A| with a uniform probability 



do better in some cases 



over A will always satisfy (16). However, it is possible to 

^here \S^\ < \S\. In such cases 



\S^\ (the most settings that must be hidden for any A) 



can be small. Here, for a uniform pobs equation ( |16| ) will 
also be satisfied provided possibly more settings than re- 
quired are hidden for each A such that |5^(A)| = \S^\ 
for all A and if C{z) is the set of As for which z G S^{X), 
\jC{z)\ = \S^\ must be constant for all z. This is a sym- 
metry condition that can be met by many Bell inequali- 
ties. As before, the existence of this example proves that 
a min-entropy seed with 



k<Nlog{\S^\) 



(22) 



can reach the no-signaling limit of B with local strategies 
for uniform input distributions. In the following section 



we will show how to obtain bounds for arbitrary Pobs and 
that approach will also give tight bounds and optimal 
strategies when the inequality is one in which the size of 
the "hidden sets" varies with A. 

Further, if Bq < Pns, in order to simulate physics 
one may be content with reaching the quantum limit. A 
possible i.i.d. seed (not proved to be optimal) is the fol- 
lowing (see Fig. [2]). With probability 1 — <7, the settings 
are chosen uniformly among all A^ possible if-tuples: 
this is measurement independence, so B < Bl on these 
cases, and the physical process A can be chosen as one of 
those that saturate B = Bj^. In the other instances, the 
settings are chosen uniformly in S^ and the physical pro- 
cess A is chosen in each case in order to achieve B = B^s • 
In other words, this seed is a convex combination of the 
measurement independent uniform seed and the seed de- 
scribed in the previous paragraph. Note that this new 
seed will automatically satisfy the constraint (16). For 



such a seed, therefore, Pm is the probability of each set- 
ting in 5f , which reads Pm = ^ [l + ^ {\S\/\S^\ - l)] • 
With this measurement-dependent strategy, one can 
reach B = qB^s + (1 — q)Bl^ so B > Bq for q > 
{Bq — Pl)/(Pns — Pl)- In summary, the quantum limit 
can be achieved with an i.i.d. seed with 



dB,Q 

M 



> 



1 

W\ 



Pns - Pl 



{\s\/\sl 



(23) 



that is, a min-entropy seed with k < —N logP^ can 
reach the quantum limit of P with local strategies, for a 
uniform input distribution. 

Let us illustrate the methodology with the analysis of 
some inequalities: 



CHSH: here, it is always necessary and sufficient to 
hide one pair of settings. Therefore \S?\ = 3 and 



V. 



the inequahty-dependent bound (22) is the same 
as the inequahty-independent one (19) to reach the 



no-signahng limit, as already proved in [12j. Recall 
that this does not prove the bounds to be tight, 
because they are based on explicit i.i.d. seeds: non 
i.i.d. seeds may lead to tighter bounds, though 
we do not know any example. As for reaching the 
quantum limit, we have Pj^ = |[1 + (a/2 — l)/3] ^ 
0.2845. 

• Chained inequality: here again, as we have seen 
in paragraph |III A[ it is always necessary and suf- 
ficient to hide only one pair of settings out of 



M 



m" 



so |5f 



m" 



1 and |5f (A)| 



for all A. As a consequence, in terms of min- 
entropy, the inequality-dependent bound (22) is 
N \og{m'^ ~ 1)7 which is approximately twice the 
value N log(2m — 1) obtained from (19). For large 



m, the quantum and no-signaling limits basically 
coincide. 

• CGLMP inequalities: like the CHSH inequality, 
the CGLMP inequalities are two party inequalities 
where each party has two inputs. However, this 
family of inequalities has d possible outputs for each 
party. In the quantum case, the CGLMP inequali- 
ties can provide more robustness against measure- 
ment dependence than the CHSH inequality, in 
the sense that the min-entropy of the inputs given 
the source must be lower if the quantum bound 
is to be achieved. The reason is that it has been 
shown that as (i ^ 00, the quantum limit increases 
and approaches the no-signalling limit (25] [26] . As 
can be seen, inspecting equation (23), the value of 
{Bq—Bi^)/{B^s — Bl) will increase with d, and the 
value of Pm necessary to reach the quantum limit 
with local resources increases, until it reaches the 

B,NS 



no-signalling value P^' in the limit 



THE POSITIVE EFFECT OF BIASING THE 
CHOICES OF THE SETTINGS 



Theorem 1 shows that assuming a full min-entropy 
source on the measurement settings, for any meaning- 
ful conclusion to be drawn from a Bell test, it must be 
that i;frnin(XY|A) > N \og{mA + rriB — 1). However, 
recalling that the role of the observed data is actually 
a constraint imposed on the underlying model (similar 
to equation (16)), we can hope to use it to our advan- 



tage. This motivates the question: for a given value of 
i:fmin(XY|A) = Nk> N \og{mA^mB-l) that is being 
assumed, what is the optimal distribution on the inputs 
such that the maximum possible Bell value obtainable 
with this degree of measurement dependence and only 
local resources is as low as possible. Because the situa- 
tion for non i.i.d. models is intractable, we are restrict- 



ing ourselves to the i.i.d. model for the remaining of this 
chapter. Here instead of the min-entropy, the guessing 
probability Pm is used exclusively as the figure of merit of 
measurement dependence. First, we consider the CHSH 
inequality as an explicit example. 



A. The CHSH Inequality 

Intuitively, we expect that the optimal solution is to 
set for each input round H^i^{XY) = H^i^{XY\A) = k 
and Pobs{xy) = 2~^ for three pairs (x, y) and Pohs{x'y') = 
1 — 3 X 2~^ for the final pair because in this case A cannot 
contain any further information on XY than is available 
simply from observing the distribution Pobs {xy) - We will 
highlight an example of this type of distribution later 
in this section. This is not a uniform distribution, so 
we can already see that non-uniform input distributions 
can be beneficial. In this section, we will consider fixed 
input distributions Pobs{xy) and find the maximum value 
that the CHSH inequality can take given a bound on 
Pm- Note that the method in this section extends to any 
multipartite Bell inequality. 

We want to find the violation ^chsh' under local re- 
sources and measurement dependence, as a function of 
Pm and Pobsixy)- To this end, observe that the local 
distributions form a convex polytope and so is the set of 
seeds with a fixed value of Pm (the seed polytope). Us- 
ing the decomposition into extremal points of a convex 
polytope, we have 

p{ab\xyX) = ^ai{X)ei{ab\xy) , (24) 



p{xy\X) = ^ft(A)/,(xy), 



(25) 



where ei{ab\xy) are the extremal points of the local poly- 
tope and fj {xy) are the extremal points of the seed poly- 
tope. Now after multiplying by both sides by Pobs{xy), 
the i.i.d. model with measurement dependence becomes 



p{abxy) = / dX^ai{X)l3j{X)ei{ab\xy)fj{xy)p{X) 
^^ ij 

= ^lij9ij{o.^xy) , (26) 

where 

jij= f d\ai{\)Pj{\)p{\), (27) 

JA 



gij{abxy) = ei{ab\xy)fj{xy) . 



(28) 



In this notation, the problem becomes a linear program, 
i.e. finding 

ScfilH(^M,Po6.(xy)) = max ^ (-!)»+''+-. Ji^ 

(29) 



subjected to the constraints 

p{abxy) = ^jijgij{abxy), ^p{abxy) = Pobs{xy) 



ab 



(30) 
for known values oi Pohsi^v) and Pm- The result is pre- 
sented in FIG. [3 



- Non-product: p(00)=0.29, p(10)=p(11)=0.29,0.28,0.27 

- Product: p(00)=0.27, p(X=0)=0.50,0.51,0.52 

- Uniform: all 1/4 




0.25 0.26 0.27 0.28 0.29 0.3 0.31 0.32 0.33 



FIG. 3: (color online) Plot of maximuni CHSH value against 
measurement dependence Pm for different Pohs- Notice that 
Pm start from max{poo,Poi,Pio,Pii} because of the data pro- 
cessing inequality: no underlying model of smaller Pm can 
reproduce the observed input statistics. In a Bell test with 
an assumed dependence bound Pm^ if the value of the in- 
equality is above the line that corresponds to the observed 
input distribution pohs then there is intrinsic randomness in 
the outcomes contributed by A. Therefore, for some observed 
violations, biased settings statistics can allow the certification 
of intrinsic randomness while uniform statistics cannot. 

Using the numerical results, it is easy to see that the 
optimal strategy for maximizing the Bell value ^chsh 
whether or not the observed distribution is uniform is to 
choose 



p{xy\X 



■.y) - I 1 



Pm 
-SPm 



ifx.ye Sx,y 
otherwise 



(31) 



for each A^^^, where Sx,y is defined in equation ([T8|. 
Choosing this strategy, it is straightforward to find an 
analytic expression for ^chsh- 



^chsh(^m) = 4- 



{l-3PM)q- 



l-3Pj 



M 



where for convenience we define q := 



4Pm 



1 



{q - 16) 



i This 

expression is only valid for mdiXx ^yPobs{xy) < Pm < ^ 
(H^'ni^y) > ^min(^^IA) > logS). Noticc that when 
the distribution is uniform q = 16 and the second term 
vanishes, leaving a linear expression in Pm- 

It is interesting to observe that for the purpose of 
violating Bell inequalities (that is, demonstrating non- 
locality by exceeding B^^^) under measurement depen- 
dence, suppose the inputs have privacy quantified by Pm^ 



then it is advantageous for us to purposely select an input 
distribution that is not uniform. This can be seen easily 
from for example the red curves for Pm = 0.27: select- 
ing uniform input distribution allows a violation up to 
about 2.5 while selecting non-uniform input distribution 
only allows a lower maximum violation! Note that for 
non-uniform distributions on the inputs the upper bound 
on the Bell value is only as low as 2 (the local bound 
assuming measurement independence) for non-product 
distributions on the inputs. (See the blue curves.) All 
non-uniform product input distributions can have Bell 
values larger than 2, if measurement independence is re- 
laxed. Notice also, that the lowest blue curve, the one 
that takes the value 2 at Pm = 0.29 is the one corre- 
sponding to the distribution [p(00),p(01),p(10),p(ll)] = 
[0.29,0.13,0.29,0.29]. This is precisely the form of the 
distribution on the inputs we has anticipated at the start 
of this section. 



B. Generalizations 

We have seen that for the ca se of the CHSH inequality, 
the strategy outlined in section IV in equation (21 ) is the 



optimal strategy even in the case that the distribution 
Pobs{xy) is not uniform. In general however this is not 
the case. It is possible to find some inequalities that 
together with some distributions Pobsi"^) do not admit a 
strategy of the form 



p{xy\Xz) 



Pm, if z g 5z 
Q(Az), otherwise 



(32) 



where Q{X) is determined by the normalization condition 
to be Q(A,) = i^lf^. 

Let us limit our focus to inequalities with symmetries 
such that l^f (A)| = l^fjand \S^{X)\ = \S^\ for all A. 
In that case, equation (IT6| can be written as a matrix 
equation, with Po6s(z) and p{X) written as vectors and 
p{z\Xx^ is a matrix whose entries are defined by equa- 
tion (32). If the p{z\X) matrix is is full-rank, then there 



is a unique solution for p{X) that is a valid probability 
distribution. This will always be the case ii\Sg\ and \Sh\ 
have no common factors. 

Examples of cases where the sizes of the sets Sg and Sh 
have no common factors are any bipartite Bell inequality 
with terms for all input pairs present and where both 
parties have the same number of inputs. For these cases, 
the min-entropy bound of section |IV| also applies for any 
non- uniform observed distribution on the inputs. 

If, for a given inequality, l^*^! and \Sh\ have at least 
one common prime factor, there may be some choices of 
distribution Pobs foi" which the strategy ( 32 ) will not be 



able to reproduce Pobs with any valid distribution p{X). 
In that case, the optimal strategy may have to be found 
numerically. For the i.i.d. case, one do this by solving 
a linear program that is a generalization of the one pre- 
sented in the previous section. 



VI. BOUNDS ON THE ACHIEVABLE 
DISTRIBUTIONS 

While the set of distributions obtainable from a mea- 
surement independence local model is the local polytope, 
that obtainable from a measurement dependence model 



is in principle a larger set. The example in section JIB 
makes it clear that this set can even include signalling 
distributions. Now we wish to study more carefully this 
new set of distributions. As mentioned in |II A[ figures of 
merit of measurement dependence can be defined as re- 
strictions on p{X\xy) instead ofp{xy\X). It turns out that 
in characterizing the set of achievable distributions, it is 
easier to work with figures of merits based on p{X\xy). 
In general, specifying p{xy\X) does not specify p{X\xy) 
unless p{xy) and p{X) are uniform, in which case the two 
are proportional. Among the figures of merit one can 
define is the one by Hall [10] 

Definition 3. Hall's Measurement Dependence. The 
quantity M quantifies the change to the probability dis- 
tribution over a random variable A given X = x^Y = y 
versus the distribution given X = x' ^Y = y' : 



M := max 

x,x' ,y,y' 



\p{X\x,y)-p{\\x',y')\d\. (33) 



M can range from 0, the measurement independence 
case, to 2, corresponding to the probability of a given 
value of A being zero for at least one of the input pairs 
{x,y) and {x\y'). 

An alternative figure of merit that we will use in this 
section is the following 

Definition 4. The quantity M' bounds the distance be- 
tween the probability distribution over the random vari- 
able A given access to x^y and the distribution over A 
without information on x^y: 

M' := max2D(p(A|A = x,Y = y),p{A)) , (34) 

X ,y 

withp{A) = T.x,yPiM^ = x,Y = y)pobs{X = x,Y = y) 
and D{-^-) is the total variational distance. 

Essentially this definition bounds the distance of any 
one element in the distribution away from a strategy in- 
dependent of Alice and Bob's inputs x and y^ which is 
similar in fiavour to a Santha-Vazirani bound. 

Now we wish to compare the point obtainable from a 
measurement dependence model with local resources, 

Pab\xy = Y.p{a\xX)p{b\yX)p{X\xy) , (35) 



to its corresponding measurement independent point, 

Pab\xy = ^p{a\xX)p{b\yX)p{X) . (36) 

A 



Their distinguishability is characterized by their total 
variational distance 

^iPAB\xy^PAB\xy) 



9 Z^ \Pab\xy Pai 



b\xy\ 



a,b 



iE 



^p{a\x\)p{b\y\) {p{X\xy) - p{X)) 



^ In [^p(a|xAM6|yA) ) \p{X\xy) - p{\)\ 

X \a,b J 

= D{p{X\x,y),p{X)) 

< M'/2, (37) 

where we have applied the triangle inequality. In other 
words. 



\jrab\xy 



' Pab\xy 



<M' Vo,5, 



,x,y, 



(38) 



which gives a bound on the distributions that can be 
created with a measurement dependence model of de- 
pendence bounded by M^ 

There is another model that relaxes assumptions about 
correlations between two parties: quantum "cross-talk" 
between non- fully-isolated devices [27]. In this case, the 
assumption that the measurement operators are in a ten- 
sor product, Ma^Mb^ is relaxed. Of course in this case, 
it is possible to use the measurement itself to introduce 
quantum correlations between the two parties. In [27] 
is is assumed that the source of the measured state dis- 
tributed to Alice and Bob is not subject to the cross-talk. 
The model was proposed to describe excess correlations 
that could result from a pair of trapped ions measured 
while situated next to each other in a refrigerator after 
being entangled. 

As introduced in [27], let us consider the cross-talk 
model bounded by x 



Xl^ inf {Uab 



nA0nB)^xi, (39) 



where H/^,Hb, and Uab are POVM elements. As noted 
in |27j, the bound ([39| implies 



\Pab\xy Pab\xy\ ^ X i 



(40) 



where the prime indicates the distribution resulting from 
non-zero cross-talk. Comparing this with equation (38), 



it is clear that a cross-talk model bounded by x implies 
the same bounds on the deviation of the distributions 
using the extra correlations (cross-talk) as the measure- 
ment dependent model when bounded by M' = x- 



VII. CONCLUSIONS 

Bell tests are an essential tool in device-independent 
approaches. They rely on a set of reasonable assump- 
tions, but some of the assumptions are untestable. In 
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particular, the correlations between source and settings 
are strictly unobservable and therefore the amount of re- 
duction of measurement independence is ultimately an 
assumption, either on the power of an adversary, on 
a physical model for the experiment. This study has 
demonstrated that when relaxing this assumption, the 
definition used, be it min-entropy or a Santha-Vazirani 
condition, is critical with respect to what kind of guar- 
antees can be obtained from a Bell test. There are re- 
sults [13, 28 showing that with a Santha-Vazirani source 
assumption arbitrarily weak randomness can be ampli- 
fied using a protocol that checks for the violation of a 
Bell inequality. This cannot be accomplished using a 
min-entropy condition, as we have demonstrated in sec- 
tion |IV[ for sufficiently low min-entropy any inequality 
can be violated up to its no-signalling bound, using only 
the classical measurement dependent correlations and in 
a way that a third party could predict all of the outcomes 
of the measurements. Even for the protocol in [11 that 
amplifies bounded randomness (in the Santha-Vazirani 
definition) using violations of the chained Bell inequal- 
ity, in order to get perfectly free bits out, the number 
of outputs for this inequality must go to infinity. As we 
point out in section III A[ in this limit the chained Bell 
inequality is not robust to any relaxation of input ran- 
domness if the min-entropy definition is used instead. 

The bounds on the min-entropy presented in section IV 
give immediate bounds for any inequality on the amount 
of input randomness required to draw conclusions about 
whether the violation of a Bell inequality can give any 
certification of quantum or non-local behavior. The 
method demonstrated for the CHSH inequality in sec- 



tion |V| demonstrates how to get tight upper bounds for 
the value a given Bell inequality can take assuming a min- 
entropy bound for any distribution over the measurement 
settings. It also shows that there may be advantages 
to deliberately choosing non-uniform distributions over 
measurement settings in device independent protocols, 
depending on what assumptions are being made. Relax- 
ing the assumption of measurement-dependence increases 
the set of probability distributions {p{ab\xy)} that can 
result from a Bell test assuming a local, realistic hidden 
variable model and section [Vl| gives an expression bound- 
ing this increase. 

Being as the assumption of measurement independence 
cannot be confirmed, it is important to understand the 
consequences for device independent protocols when it is 
relaxed. It is especially interesting that the min-entropy 
condition, a condition widely adopted in classical secu- 
rity studies [m [29ti3T] . is has such a different behav- 
ior from the Santha-Vazirani condition for these device- 
testing purposes. We hope that the bounds and charac- 
terizations provided here will be useful for constructing 
protocols that are more robust to extraneous correlations. 
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tribution but a notation for a procedure that picks up 
the maximum of a probability distribution. As an ex- 
treme example, if Z looks uniform but the knowledge of 



Ea ■P(A = A)Pguess(Z|A = A) = 1. 

The chained Bell inequality is an example of an inequality 
whose value depends only some of the possible inputs. 
Only terms such that x = yoix = y+l and the term 
for X — 1, y — m appear. 



Appendix A: Generalization of Lemma [T] the 
multipartite case 



A determines z uniquely, one has Pguess(Z) 



and 



The bound (20) for the min-entropy in a multipartite 
scenario is based on the generalization of Lemma [T] that 
we provide here: 

Lemma 2. Let P(o|z) he an arbitrary K -partite no- 
signaling distribution with z = (2:1, ..., z^) where zi G 
{1, ...^vfii) and o is a K-tuple of outcomes. For any K- 
tuple of settings z = (^i,...,^k); there exists a local dis- 
tribution Pl{o\z) such that 

Pl{o\z) = P(o|z) (Al) 

z[ e {l,...,mi}} . 

Moreover, this result is tight: if another K-tuple of set- 
tings is added to the subset S^, there exist a no-signaling 
point for which those probabilities are nonlocal. 
Proof. Again, let us fix z = (1, ..., 1) without loss of gen- 
erality. Let o\. be the zth party's outcome given the z^th 
measurement setting. From the no-signaling distribution 
P, we construct a valid probability distribution 



PK...oi,^;...;of...o^J = P{o\) 



K 



Wp{o\\o\...c^,-') 



K rrii 
i=lj=2 



i-lj+l 



...of) 



(A2) 



whose marginals P(oJ^; ...; o^) = Pl{o^ •.•o^\zi...Zk) de- 
fine a local distribution by Fine's result [24 . To ver- 
ify that we have a local distribution that mimics the 
initial no-signaling one on the desired subset of pairs 
of settings, consider this example: for the input string 



(1,^2, ...^/c) with the distribution V{o\',dl^',...',of^) = 
Pl(o^o^...o^|1, ^2, ...^fe) we sum first over all possible val- 
ues of each outcome variable 02, ..., o^^ to find 



K 



P(ol;o?...o^^;...;of...o^J = P(o{)n 



P{o\\o\...o\-')llP{c 



i=2 



A-li+l 



...of) 



(A3) 



after which continue to sum over all the o^, ..., o^ except o^^ ..., of and one is left with a probability distribution 
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P(o\,ol 



...o 



K 



) on only K variables, one for each party. a given hidden variable value (see equation (20) in the 



The other verifications are similar. Another way to think 
of it is to notice that each conditional probability factor 
on K variables (one variable conditioned on i^ — 1 other 
variables) effectively sets a joint probability distribution 
on those same K variables. In the distribution (A2) there 



^K 



are X^^^^ rrii — K ^ 1 such factors and so this is exactly 
how many local points Pl{o\z) that can be matched for 



main text). The argument for tightness still works if we 
consider only two parties among K. For any two parties 
we can choose a pair of inputs for each to return to a 
CHSH-type scenario, then the argument follows in the 
same way as in the proof of Lemma [T] 
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